The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Outlier detection is an interesting data mining task, which detects rare events. This paper focuses on the method of outlier detection based on frequent pattern (FP method for short). First we analyze the drawback of this method, and then an improved method (LFP method for short) has been presented. Finally, we evaluate the two methods by using several datasets and the experiment results show that...
Many of the previous studies show convincing arguments that mining frequent subgraphs is especially useful. Many hidden frequent patterns which are very interesting can not be found by mining single graph. Previous studies as Quasi-Clique have little success with the hub problem. In this paper, we introduce a new conception Correlated-Quasi-Clique and develop a novel algorithm, CoClique, to address...
Text classification is the key technology for topic tracking, and vector space model (VSM) is one of the most simple and effective model for topics representation. On the basis of VSM and support vector machines (SVM), we have studied how feature space dimension in VSM as well as linearly separable and non-separable SVM affect topic tracking. Then we get the variation law that they affect topic tracking,...
The information on the Internet has been grown exponentially, the Internet users are overwhelmed by these information. How to automatically extract useful information from the relevant pages, so as to provide a convenient and rapid information query platform for the users, is an important issue. In this paper, based on simple tree matching algorithm, we present a Web data extraction method based on...
This paper presents an approach for synthetic aperture radar (SAR) target recognition with data fusion. The data of multi-aspect images of a target are fused by principal component analysis (PCA) or discrete wavelet transform (DWT) after preprocessing. Wavelet domain PCA is used to extract feature vectors from the fused data. Support vector machine (SVM) is applied to classify the extracted feature...
Web-scale relation extraction is crucial to building the Web people search engines. Previous extraction models, such as Snowball, focus only on single type extraction, while the real applications always require as many as possible types of relation. In this paper, we propose a novel Web-scale relation extraction framework Multi-Type Snowball (MultiSnowball). MultiSnowball targets at extracting multiple...
This article choosing all the 3987 documents' citation as data sample of Knowledge Discovery(KD), which was published in Web of Science(SCI-EXPANDED, SSCI, A&HCI) from 1986 to2009, confirming the hot research topics and the research fronts by using word frequency analysis and detect key words that their term frequency changed notably, and drawing the knowledge mapping of them by using Citespace:...
Artificial neural network (ANN) shows good nonlinear mapping ability in many applications compared to traditional algorithms. In many applications, it is now widely used to extract knowledge from the train neural network. The fact that the model obtained with neural network is not understandable in terms of black box model is a brake to their use in this field. To enhance the explanation of ANN, a...
The common bus on bus control is RS232. With the development of field bus technology, field bus technology is widely used in data recorder. How to real time record the data of field bus in the experiment and provide the test data for process reconstruction is a very important problem. This paper introduces a kind of data acquisition and storage based on field bus. Its core design is C8051F040 microprocessor...
Outlier mining is an important branch of data mining and has attracted much attention recently. The density-based method LOF is widely used in application. However, the complexity of the method is quadratic to size of the dataset, and it is very sensitive to its parameters MinPts. In this paper, we propose a new outlier detection method based on Voronoi diagram, called Voronoi based Outlier Detection...
With the rapid development of deep web, high quality data pre-processing and extraction are extremely essential from these web data sources. The clustering is a crucial step for the data processing. This paper presents a unified solution to tackle the issue of clustering e-business web contents. Firstly, the vocabulary are segmented based on the obtained web contents, and then perform statistically...
This paper combines rough set method and evidential reasoning method, using evidential reasoning approach as a comprehensive settlement on the one-sided defect of rough set data mining method and using rough set method as a comprehensive settlement on the subjective defect of evidential reasoning method. Thus we construct a scientific and rational firm evaluation model. Firstly, model uses the attribute...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.